Preprint IMAFF 98/01 



OSTROGRADSKI FORMALISM 
FOR 

HIGHER-DERIVATIVE SCALAR FIELD THEORIES 

by 

'66 . 

G\ ■ F.J. de Urries 

^ . 

and 
J.Julve (t) 

W Departamento de Fisica, Universidad de Alcald de Henares, 
C^l \ 28871 Alcald de Henares (Madrid), Spain 



> 

lO ' Serrano 113 bis, Madrid 28006, Spain 



IMAFF, Consejo Superior de Investigaciones Cientzficas, 



O 1 ABSTRACT 
00 ' 
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1. Introduction 

Theories with higher order Lagrangians have an old tradition in physics, and 
Podolski's Generalized Electrodynamics [1] (later visited as a useful testbed [2]), ef- 
fective gravity and tachyons [3] are examples. The interest in higher order mechanical 
systems is alive until today [4]. 

Theories of gravity with terms of any order in curvatures arise as part of the low 
energy effective theories of the strings [5] and from the dynamics of quantum fields in 
a curved spacetime background [6]. Theories of second order (4-derivative theories 
in the following) have been studied more closely in the literature because they are 
renormalizable [7] in four dimensions. They greatly affect the effective potential and 
phase transitions of scalar fields in curved space-time, with a wealth of astrophysical 
and cosmological properties [8]. In particular a procedure based on the Legendre 
transformation was devised [9] to recast them as an equivalent theory of second dif- 
ferential order. A suitable diagonalization of the resulting theory was found later [10] 
that yields the explicit independent fields for the degrees of freedom (DOF) involved, 
usually including Weyl ghosts. 

In [11] the simplest example of this procedure was given using a model of one 
scalar field with a massless and a massive DOF. In an appendix, Barth and Chris- 
tensen [12] gave the splitting of the higher-derivative (HD) propagator into quadratic 
ones for the 4th, 6th and 8th differential-order scalar theories. A scalar 6-derivative 
theory has been considered in [13] as a regularization of the Higgs model, yielding a 
finite theory. 

Classical treatises [14] study the Lagrangian and Hamiltonian theories of systems 
with a finite number of DOF and higher time-derivatives of the generalized coordi- 
nates. Later work has considered the variational problem of those theories with the 
tools of the Cartan form, k-jets, symplectic geometry and Legendre mappings [15]. 

However the particular case of relativistic covariant field theories has complica- 
tions of its own which are not covered by those general treatments. Our presentation 
highlights the Lorentz covariance and the particle aspect of the theory, with emphasis 
in the structure of the propagators and the coupling to other matter sources. We 
address this issue by using a simplified model with scalar fields as in [11] and [12], 
and our extrapolation of the canonical analysis to these continous systems validates 
the formal procedures introduced there. The analysis here presented mostly focuses 
on the free part of the Lagrangian, and self-interactions and interactions with other 
fields are embodied in a source term. 

In Section 2 we briefly review the Ostrogradski method and outline our exten- 
sion to the field theories. In Section 3 we study the case of the 4-derivative theory 
for arbitrary non-degenerate masses, which exemplifies the use of the Helmholtz La- 
grangian and the crucial diagonalization of the fields. The 8-derivative case and higher 
4iV-derivative cases are considered in Section 4. For even N the 2A^-derivative cases 
present some peculiarities that deserve the separate discussion of Section 5. Our 
results are summarized in the Conclusions. 

As a general feature, our procedure involves vectors with pure real and imaginary 
components as well as symmetric matrices with equally assorted elements. Diagonal- 
izing symmetric matrices of this kind is a non-standard task which is detailed in an 
Appendix. 
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2. The Ostrogradski's method. 



We consider a HD Lagrangian theory for a system described by configuration 
variables q(t) . By dropping total derivatives, it can be always brought to a 
standard form 

( m )^ 

L[q,q,q,-, q \ , (2.1) 

depending on time derivatives of the lowest possible order. The variational principle 
then yields equations of motion which are of differential order 2m at most: 

dL d dL . Nm d m dL 

l_ . . . _i_ ( _i ) m = n (22) 

dq dtdq 1 } dt m Q (rn) ■ 1 • } 

Hamilton's equations are obtained by defining m generalized momenta 

dL 



d q 
_ dL d 

Pi = Zti-dt Pi+1 (* = 1 »- TO - 1 ) ' 

dq 



(2.3) 



and the m independent variables 

Qi = Q 



Qi = Q (i = 2,...,m) 



(2.4) 



Then the Lagrangian may be considered to depend on the coordinates qi and only 

on the first time derivative q m = ^q^ .A Hamiltonian on the phase space [qi,Pi] 
may then be found by working q m out of the first equation (2.3) as a function 



qm[?l,-,9m;Pm] , ( 2 -5) 

the remaining velocities qi (i = l,...,m — 1) already being expressed in terms of 
coordinates, because of (2.4), as 

qi = qi+i ■ (2.6) 

Thus 



m m—1 

H [Qi,Pi] = ^2 Pi ^ { ~ L = +M«n - L[qi, ...g m ;qm] 



(2.7) 



Therefore 



m—l 



rn 



1=1 



but (2.3) can be written as 

dL 



n . — Pm 

OL 

= p i +p i _ 1 [1 = 2, ...,m) , 



and (2.2), because of (2.3), gives 

dL _ dL 
dqi dq 

so we get 



(2.8) 



(2.9) 



= Pi , (2-10) 



SH = Y^{~Pi 5( li + QiSpi) , (2.11) 



i=i 

and the canonical equations of motion turn out to be 

9H . dH 

dp; % 

Summarizing we may say that a theory with one configuration coordinate q obey- 
ing equations of motion of 2m differential order (stemming from a Lagrangian 

with quadratic terms in q as its highest derivative dependence) can be cast as 
a set of lst-order canonical equations for 2m phase-space variables [qi,Pi] ■ 

As it is well known, once the differential order has been reduced by the Hamilto- 
nian formalism, one may prefer to obtain the same canonical equations of motion from 
a variational principle. Then the canonical equations (2.12) are the Euler equations 
of the so-called Helmholtz Lagrangian 



L H [qi,qi,Pi] PtQi- H[qi,pi] (2.13) 

i=l 
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which depends on the 2m coordinates q± and pi , and only on the velocities 
qi . This alternative setting will be adopted later on. 

As far as finite-dimensional mechanical systems are considered, only time deriva- 
tives are involved. The generalized momenta above have a mechanical meaning and 
the resulting Hamiltonian is the energy of the system up to problems of positiveness 
linked to the occurrence of ghost states. 

Extension to field theories 

Continuous systems with field coordinates <f>(t, x) usually involve space deriva- 
tives as well, chiefly if relativistic covariance is assumed. We now generalize the pre- 
vious formalism to this case. A HD field Lagrangian density will have the general 
dependence 

C[<f>, </» M ,...> Ml ... Mm ] , (2.14) 
where = <9 Ml • • -<9 Mi , with corresponding equations of motion 

i- 8 "S + - +( - ir8 «- 8 '-^ =0 ■ (215) 

The generalized momenta now are 

dC 



<90 M1 ... 



Mr. 



tt" 1 -* = — a Mi+1 7r^-^^+ 1 (i = 1, m-1) 

C^Mi'-'Mi 



(2.16) 



Though they have not a direct mechanical meaning of impulses they still are suitable 
to perform a Legendre transformation upon. 

Assuming also that the highest derivative can be worked out of the first equation 
of (2.16) as a function Ml ... Mm [<f>, (f>^, 4>ni-Hm-i 5 7r Ml '"^] , the "Hamiltonian" 
density now is 



(2.17) 



Then the canonical equations are 



_ dU 

on 



(2.18) 



90 



90 Ml 



Ml ■"Cra-l 
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This general setting may be hardly applicable to systems of practical interest 
(generally involving internal symmetries and / or fields belonging to less trivial Lorentz 
representations) if suitable strategies are not adopted to refine the method. One cru- 
cial observation is that the momenta may be defined in more useful and general ways 
than the plain one introduced in (2.16): instead of differentiating with respect to the 
simple field derivatives </> Ml ... Mi one may consider combinations of field derivatives 
of different orders belonging to the same Lorentz and internal group representations. 
For instance, in HD gravity [9] , the Ricci tensor is a most suited combination of second 
derivatives of the metric tensor field. The only condition is that the Lagrangian be 
regular in the highest "velocity" so defined. This will be made clear in the following. 

In fact this general Ostrogradski treatment can be significantly simplified for the 
Lorentz invariant theory of a scalar field, which is the example we will consider in 
this paper. In this case, dropping total derivatives, the general form (2.14 ) can be 
expressed in a more convenient way that singles out the free quadratic part, namely 



£ = -_0[l][2]...[iV]0-j0 , (2.19) 

where [i] = (□ + mf ) , our Minkowski signature is (+,—,—,—) so that 
□ = df — A , and c is a dimensional constant. The masses are ordered such 
that rrii > rrij when % < j so that the objects (ij) = (mf —mj) are always 
positive when i < j . 

It turns out to be very advantageous to consider only Lorentz invariant combi- 
nations of derivatives of the type □ n and of the (p field itself with suitable 
dimensional coefficients. Further, it is even more useful to consider expressions of the 
form [i] n </> . 

Thus, arbitrarily focusing ourselves on i = 1 without loss of generality, equation 
(2.19) may be recast as 

1 N 

C = -J2^ll] n ^-J^ , (2-20) 

n=l 



where the c n are redefined constants. 

Calling m = ^ for even N , and 
equation now reads 



m = for odd N , the motion 



m ~„ N 

^ |1] "w) = ^ c " ,1I> = J (2 ' 21) 

n=l vu. ji t / n=l 

The Legendre transform can now be performed upon the simpler set of generalized 
momenta 
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TTm. = 



dC 



dC , , 

<9(M m V) (2.22) 



dC 

^ = liB) +[1,7rs+1 (« = l,.-.,m-2). 

The Hamiltonian will depend on the new phase-space coordinates 
iy[0i,...,0 m ;7Ti,...,7r m ] , where 0, = [l] i_1 . To this end [l] m </> has been 
worked out of the 1st (2.22) for even N , or of the 2nd (2.22) for odd N , in 
terms of these coordinates. 

The dynamics of the system is given by the 2m equations of second order 

M* = f 

(i=l,...,m) . (2.23) 

Notice that, in comparison with (2. 12), (2. 16) and (2.18), no negative sign occurs in 
both (2.22) and (2.23), because each step now involves two derivative orders. 

As a final comment, the treatment followed above keeps Lorentz invar iance ex- 
plicitely, and this will turn advantageous later on. The price has been that neither 
the 7r's have the meaning of mechanical momenta nor H has to do with the 
energy of the system. However they are adequate for providing a set of " canonical" 
equations that correctly describe the evolution of the system. Moreover, these equa- 
tions are Lorentz invariant and of 2nd differential order, which will lend itself to an 
almost direct particle interpretation. 



One may however choose to work with the genuine Hamiltonian and mechanical 
momenta obtained when the Legendre transformation built-in in the Ostrogradski 
method involves only the true "velocities" d™(j) . The price now is loosing the 
explicit Lorentz invariance and facing more cumbersome calculations, as we will see 
by an example in the 2nd part of the next Section. 



3. N=2 theories. 

These theories allow a particularly simple treatment that will be illustrated in 
the examples N = 2 and N = 4 . The equations (2.23) for N = 2 will 
now be obtained from a Helmholtz-like Lagrangian of 2nd differential order, which is 
closer to a direct particle interpretation. 
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Consider the N = 2 Lagrangian 



£ 4 = -^0[lJ[2]0-.7 ■ (3.1) 

with non-degenerate masses mi > ■ Taking the dimensional constant 
M = (ml — m|) = (12) > , equation (3.1) yields the propagator 

(12> - 1 L , (3-2) 



W [2] II] [2] 

We thus see that the pole at mi then corresponds to a physical particle and the 
one at m\ to a negative norm "poltergeist". The 2nd order Lagrangian we are 
seeking should describe two fields with precisely the particle propagators occurring in 
the r.h.s. of (3.2). 

The Lagrangian (3.1) can be brought to the form (2.20), namely 



(3.3) 



£ 4 [0, [1]0] = -I J_[0[lf 0- (12)0[l]0]-j0 

= -^P)[(w^) 2 -( 12 )^w^)]-^ ' 

where the relationship [2] = [1] — (12) has been used. 
We define one momentum 

dC . . 

from which is readily worked out, obtaining 

n^n] = -^(12)(-n + ^) 2 + J( p (3.5) 
and the Helmholtz-like Lagrangian is 

£|,[0,[l]0,7r] = 7r[l]0-W[0,7r] . (3.6) 

It contains mixed terms % <fi that obscure the particle contents. The diagonaliza- 
tion is achieved by new fields 4>i , 02 

4> = 0i + 4>2 

i (3.7) 
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to yield 



C 2 = 1^1110! -I0 2 [2]0 2 - j (0!+0 2 ) , (3.8) 

where the particle propagators in the r.h.s. of (3.2) are apparent. This result is 
physically meaningful: where we had a single field , coupled to a source j , 
propagating with the quartic propagator in the l.h.s. of (3.2) as implied by the 
HD Lagrangian (3.1), we now have two fields 0i , 2 describing particles with 
quadratic propagators, and the source couples to the sum 0i + 2 . 

A deeper insight of the phase-space structure of the theory can be achieved by 
the plain use of the Ostrogradski method, eventually confirming the final form (3.8). 
In order to explicitely show the velocities, we write (3.1) in the form of the Lagrangian 
density 

£4 = ~\ (iV (<9 * 2</,)2 " {dt<t>)s{dt<t>) +0p0} ~ h (3 - 9) 

where S = M 2 +M 2 , P = M 2 M 2 and Mf = m 2 — A are operators containing 
the space derivatives. 

The Ostrogradski formalism yields the Hamiltonian density 



H 4 [0, 0; TTx, n 2 ] = --(12)n 2 + 7n0 - - W + -^y^ + j <f> (3.10) 

that depends on the phase-space coordinates , > > ^2 and on their space 
derivatives. The highest-order "velocity" d 2 (p has been worked out of the mo- 
menta 



dC 4 1 

(3.11) 



^-^10) (12) a * 



<9£ 4 
<9(<9t0) 

The canonical equations may be derived from the Helmholtz Lagrangian 



£^[0,0;7ri7r 2 ;a t 0,a t 0] = 7r 2 <9 t + ^xd t 4> + ^(12)^ - 7Ti0 + ^T^&S^ 

1 1 (3-12) 
-2(12)^"^ • 
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This is a Lagrangian density of 1st order in time derivatives, and we express it in 
matrix form for later convenience: 



(3.13) 



where fx is an arbitrary mass parameter and 





( T ^ 


( 


$ = 








// _1 7ri 






V d> ) 


V 



/(12) 



V o 







M 2 g 
<12> 










1 
-1 



\ 



I 











1 

-1 o/ 



J 



(12) 








(3.14) 



with mass dimensions [<&] = 1 , [.M4] = 2 and [J] = 3 . 

In order to relate (3.13) to (3.8), we have to convert the latter into a 1st order 
theory as well. This is readily done by expressing the velocities d t 4>i and d t 4>2 
in terms of the momenta 



dC 2 

n i = 0/0 , \ = -dt<t>i , 



7T 2 = 



dC? 

d{d t <h) 



d, 



W2 



(3.15) 



so that 



H 2 [</> u 02,7fi,7f 2 ] = -\*l + -^iM 1 2 </, 1 + i ( /, 2 M 2 2 (/) 2 +j(0 1 +^ 2 ) . (3.16) 



The Helmholtz Lagrangian that yields the canonical equations now is 



c 2 H = —Q T /j e ds + -e T M 2 q-j t zq 



(3.17) 



where 



e = 



/At _i 7fi\ 
01 

fl 1 7T 2 
V </>2 / 



M 2 = 





M\ 


V 










\ 




-Ml) 



(3.18) 
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with mass dimensions [6] = 1 and [M.2] = 2 , and Z is any matrix with the 
fourth row equal to (0, 1, 0, 1) . 

The field redefinition analogous to the diagonalizing equations (3.7) now is a 
4x4 mixing of fields given by 



$ = xe 



(3.19) 



where the invertible matrix 



X 



( 
-1 

Mi 



2) 



T12T 




1 





1 

Ml 



<12> 







1 / 



(3.20) 



verifies 



X EX = E 



(3.21) 



X T M 4 X = M 2 (3.22) 

so we can identify Z = X . 

We thus see that (3.19) translates (3.13) into (3.17), and therefore the La- 
grangians (3.9) and (3.8) are again seen to be equivalent. The derivation of the matrix 
X is cumbersome but contains interesting details that worth the Appendix. Notice 
that the components of $ are expressed by (3.19) in terms of the components of 
and of their space derivatives. This is not surprising as long as 7Ti , given by 
(3.11), contains space derivatives of 4> as well. 

Though the plain non-covariant Ostrogradski method we have just implemented 
eventually shows up the Lorentz invariance, the readiness of the explicitely covari- 
ant procedure formerly introduced in this Section is apparent. The non-covariant 
approach using the canonical Hamiltonian and mechanical momenta is rigourous and 
validates the former, but involves more bulky diagonalizing matrices with elements 
that contain space derivatives. 

4. N=4 and higher even N theories 

We treat the N = A Theory with the far more practical Lorentz invariant 
method of the previous Section. Otherwise one would have to face the diagonal- 

ization of 8x8 matrices analogous to M 2 and M 4 in Appendix A. Our 
Lagrangian now is 

/:8 = "^ </,IllI2lI3lI4l</, "^ (ZL1) 
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where the mass dimensions [fj] = [0] = 1 , [M] = 12 and [j] = 3 are such 
that [£ 8 ] = 4 . Taking M = (12) (13) (14) (23) (24) (34) , equation (4.1) treats the 
masses rrii (i = 1, ...,4) on an equal footing, which is apparent in the propagator 



V~ 6 M (1)_(2) (3)_(4) 
[1][2][3][4] [1] [2] + [3] [4] 1 • } 

where (i) = fi~ 6 M Y\ jfjj (remind the ordering convention i < j ) with mass 

3 

dimensions = . 

As for (3.2), the propagator expansion (4.2) suggests that the lower-derivative 
equivalent theory should now be 



- + 02 + </>3 + ^4) • 

We derive this Lagrangian from (4.1) in the following. In matrix form, (4.3) reads 

C 2 = \t t {1\It+ 1 -t t M 2 t- J t Ft , (4.4) 

where 



r 









/ 


-z(2)-50 2 
(3)-i0 3 


, J = 






, M 2 = 


V-z(4)"W 




W 


\ 











-(12) 








-(13) 





, (4.5) 



I is the 4x4 identity, and F is any matrix with the fourth row equal to 

«l>*,*<2>i,<3>i,i<4>4). 

By dropping total derivatives we express (4.1) in a standard form involving deriva- 
tives of the lowest possible order, namely 



£ 8 [0, [1J0, [1] 2 0] = -^{(WV) 2 - S([1]0)([1] 2 0) +P([1]0) 2 

-P0([l]0)}-j0 , 



(4.6) 



where S = (12) + (13) + (14) , p = (12) (13) + (12) (14) + (13) (14) , and 
P= (12) (13) (14). 
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Ostrogradski-like momenta are denned as follows 



7T2 = 



1"! 



" +Uh2 



= -C(Il] 2 0) + ^W 



2M 



(4.7) 



From the 1st of (4.7) the highest derivative is worked out, namely 



[l] 2 ^[7r 2 ,[l]0] = -^7r 2 + |([l]0) . 



(4.8) 



The " Hamiltonian" functional is 



n\M2^iM = 7 r 2 [l] 2 ( /> + 7 r 1 ^ 2 -£ 8 [^ 1 ,V2,[l] 2 </>] , (4.9) 

. Its canonical equations can be derived from the 



where ipi = and V2 
Lagrangian 



where J is the same as in (4.5), 



(4.10) 



/ ^ „ 2 \ 

2 



7Tl 



/ n~ 10 M 



M 8 = 



K, 





1 





°\ 


1 




















1 


Vo 





1 


0/ 



and 



2 



s 
2 



■(p- 



\ 

,2 



V 








M I 
,,2 



-/r 





2(1) 





2<T) / 

Prior to its diagonalization we write (4.10) in the form 



(4.11) 



where O = (T ,T )~ 1 $ , with 



(4.12) 



V = 



1 

71 



/ 1 


1 





°\ 


—i 


% 














1 


1 


V 





— % 


i/ 



(4.13) 
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and 



M 8 = VM 8 V 



-l 



1 

2 



/M_ - 5 
-iM+ 

V ^ 2 1+ 



-iM 



+ 



-(M_ + 5) 
-z/x 2 l_ 



2 1_ 



-,u 2 l 








/ 



(4.14) 



with M± = ^±^(p-f) and 1± = 1±2^_. 

Now the task is to establish the equivalence of (4.12) and (4.4). One may first 
check that the eigenvalues A* (i = 1, 4) of Ms are the diagonal elements of 
M2 in (4.5). The orthogonal matrix T that diagonalizes Ms is obtained 
by working out its orthonormal eigenvectors | A^) with the suitable sign, and 
arranging them as the columns. These are 



Ai) = 



A, 



(1)1 
V2 



( \ 


1+ 



or 



+ (ljK2(lj)-S-M.)]\ 



V2[-^M + 2(1 j)-S] 



^[-^ + (lj)M + ] 



l_[-2^- 10 M + 2(lj) -S] 
V -zl + [-2^- 10 M + 2(lj)-^] J 



(4.15) 



where j = 2,3,4 . If / is the identity matrix, we therefore have 



T T IT=I , T T M 8 T = M 2 , 



(4.16) 



and the fourth row of V T T can be seen to be ((1) 2 , i(2) 2 , (3) 2 , i(A) 2 ) , i.e. it 
has the required form for F . Then, by taking O = Tr , (4.12) is identical to 
(4.4). 



The general case for even N > 6 in the covariant treatment would involve 
Y Ostrogradski-like momenta and the diagonalization of a N x N mass ma- 
trix. The non-covariant Ostrogradski method introduced in Section 3, which reduces 
the theory to a 1st differential-order form, would now involve 2N x 2N matri- 
ces. In both treatments the procedure would follow analogous paths, albeit with the 
occurrence of intractable eigenvector and diagonalization problems. 
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5. N=3 and higher odd N theories. 

For N = 3 , the HD Lagrangian 

c = -\^mimm-3^ , (5.1) 

where M = (12) (13) (23) and [£ 6 ] = 4 , yields the propagator 

V~ 2 M /x~ 2 (23) /x~ 2 (13) /x~ 2 (12) 
I1M2M31 " 111 + 121 131 ' 1 J 



Then, the expected equivalent 2nd-order theory is 



C 2 = -\-^jM^<h + Ij^faPlfa - ^03[3]0 3 - j(<h + ^ + <h) ■ (5.3) 

Already for N = 3 , the non-covariant Ostrogradski method becomes exceed- 
ingly cumbersome. In fact, it reduces both (5.1) and (5.3) to 1st differential order in 
time. Proving the equivalence of those theories then involves the diagonalization of 
6x6 matrices (the counterpart of M.^ and M.2 in (A. 4)), although with a 
reasonable amount of work it can still be checked that both mass matrices have the 
same eigenvalues, namely ±/zMi , ±/iM 2 and ±/uM 3 . Finding the eigen- 
vectors and building up the compound diagonalizing transformation does not worth 
the effort. 

For the odd N theories, the covariant method exhibits an interesting feature. 
Without loss of generality we again single out the Klein-Gordon operator [1] and 
write (5.1) as 



£ 6 [^, [110, [1F0] = -^{(]i]0xji] 2 0) - s(jiw + pmm - j * , (5-4) 

where now S = (12) + (13) and P=(12)(13). 
The momenta are 

dC 6 1 fx 2 



(5.5) 
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Unlike in (4.7), the highest derivative now is worked out of tt\ (instead of 7r 2 ), 
namely 



M 



(5.6) 



and, in terms of the coordinates tt\ , 7r 2 , ipi = 4> an d ^2 = [1J0 , the "Hamilto- 
nian" reads 



U 6 [^U fa, 7T1, 7T 2 ] = 7T 2 [l] 2 </> + 7Tl^2 " ^l, V>2, [l] 2 0] 

The Helmholtz Lagrangian is 



(5.7) 



M 



C H [lpl, ^2,^1,^2] = ^[l]^ +7Tl[l]V'l + ^-7Ti7r 2 - 5772^2 + -P^2^1 



/7/ 



1 u 2 



(5.8) 



The distinctive feature of the odd N cases is that the 1st of (5.5), namely 

2 

7r 2 = —\j4^2 1 is a constraint that guarantees the relationship [ljV'i = V>2 > so one 
just has N degrees of freedom. For even N it arises directly as an equation 
of motion. Moreover, unlike the Dirac Lagrangian for spin-i fields or the constraints 
introduced by means of multipliers, the constraint above can be freely imposed on 
the Lagrangian since it does not eliminate the dependence on the remaining variables 
ifji and 7Ti . Thus, (5.8) can be expressed in terms of only the three fields Vi ? 
TTi and 7r 2 : 



(5.9) 



where 



K! 



M 3 = 




(5.10) 
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The Lagrangian (5.9) is expected to be equivalent to (5.3), which in matrix form reads 



^ = -\r T mir + \r T M' 2 r-J T Gr 



(5.11) 



where I is the 3x3 identity matrix, 



/ ^23) 
i//(13)-3 

V ^(12) 



n 

2 6 2 



M' 




(5.12) 



and G is any matrix with the third row given by (// 1 (23) 2 , — i^i 1 (13) 2 , ^ 1 (12) 2 ) . 
The transformation of (5.9) into (5.11) is performed by the field redefinition 

$ = V'Tt , (5.13) 

where 



V EE 




M 3 





\/2M 





-% -1 





-% 1 



(5.14) 



and T is an orthogonal matrix built up with the eigenvectors of V' T M3V 
namely 



( i^<12)i -2^(13>i \ 



2^ 



(23)" 



.p_ 



\ p 



3 + 



(12)- 



*4fe<i3>- 



p_ 



(5.15) 



(13)- V 



with P± ee P ± fi~ 2 2M . 

Then V' T K.'V = -I and T T V' T M 3 V'T = M' 2 . One may also check that 
P'T has the same third row required for G . 

The covariant treatment of the general odd N > 5 case proceeds along hte 
same lines. Initially (N+l)/2 Ostrogradski coordinates plus the corresponding 
momenta occur. Again the definition of the highest momentum yields a constraint 
with the same meaning as above, while the highest field derivative is worked out of 
the next momentum definition. Then one faces the diagonalization of a Helmholtz 
Lagrangian depending on just iV fields. 

Already in the = 3 case one might have chosen not to implement the 

constraint on the Lagrangian (5.8) and let it to arise in the equations of motion. 
These equations are the canonical ones for the Hamiltonian (5.7) and involve an even 
number of variables, as required by phase space. Thus one keeps the dependence of 
the Lagrangian (5.8) on the four fields ipi , ip2 , 7Ti and 7r2 . Notwithstanding 
this enlarged dependence, it may still be diagonalized by new fields <j>i , <j>2 , 03 
and ( , the (expected) surprise being that ( does not couple to the source j . 
It is a spurious field, which moreover vanishes when the constraint is implemented. 
We skip here the details of this derivation. 



17 



6. Conclusions 

We have shown the physical equivalence between relativistic HD theories of a 
scalar field and a reduced 2nd differential-order counterpart. The free part of the HD 
scalar theories can always be brought to the form (2.19) integrating by parts, and the 
only limitation of our procedure is the non-degeneracy of the resulting Klein-Gordon 
masses, i.e. we consider regular Lagrangians. The existence of a lower-derivative 
version is already suggested by the algebraic decomposition of the HD propagator 
into a sum of second-order pieces showing (physical and ghost) particle poles. The 
order-reducing program we have developed relies on an extension of the Legendre 
transformation procedure, on the use of the modified action principle (Helmholtz 
Lagrangian) and on a suitable diagonalization. A basic ingredient of this program is 
the Ostrogradski formalism, which we have extended to field systems. 

Two approaches have been worked out. The first one follows Ostrogradski more 
closely by defining generalized momenta and Hamiltonians with a standard mechanical 
meaning, at the price of treating time separately and loosing the explicit Lorentz 
invariance. It validates a second and more powerful one which is explicitly Lorentz 
invariant. The rigorous non-invariant phase-space analysis strongly backs also the 
formal covariant methods used in HD gravity, where R^ u [g^ dg, d 2 g] and dh^ 
(in the linearized theory) are used in the Legendre transformation. 

The HD theories of a scalar field we have considered are generalized Klein-Gordon 
theories, and hence of 2A differential order according to the number A of KG 
operators involved. While the non-invarint approach treats all the theories on the 
same footing, the odd A and the even A cases feature qualitative differences 
in the invariant method. Also the ratio of physical versus ghost fields varies. For even 
A one finds A/2 fields of each type. For odd A one has (A — 1)/2 ghost 
(physical) and (A + l)/2 physical (ghost) fields according to the overall negative 
(positive) sign of the free part of the HD Lagrangian. The squared masses may be 
shifted by an arbitrary common ammount, since only their differences are involved in 
the procedure. Then any of them may be zero (only one in this case), or tachyonic. 

On the other hand, the non-invariant procedure gets exceedingly cumbersome 
already for A = 3 , in contrast with the (more compact) invariant one which 
remains tractable up to A = 4 at least. Both approaches are applicable to higher 
A , only at the prize of increasing the length of the calculations (namely analitically 
diagonalizing Ax A matrices). An intriguing feature of the odd A cases when 
treated with the invariant method is the occurrence of a constraint on an otherwise 
overabundant set of Ostrogradski-like coordinates and momenta, together with a less 
conventional way of working out the highest field derivative. Ignoring the constraint 
causes the appearance of a spurious decoupled scalar field. 

Acknowledgements 

We are indebt to Dr. J. Leon for the careful reading of the manuscript and useful 
suggestions. 



18 



Appendix 

The problem of finding a matrix X with the properties (3.21) and (3.22) can 
be brought to the one of diagonalizing a symmetric 4x4 matrix with pure real and 
imaginary elements. The procedure is somehow tricky since there is no similarity-like 
transformation that brings the symplectic matrix E to the identity matrix, thus 
preventing a plain use of the weaponry of orthonormal transformations. We introduce 
the diagonal matrices / = diag(i, 1, 1, —i) and g = diag(l, i, i, —1) so that 



E = gKf , where K 



1 1) 1 I) l) \ 

10 

1 

Vo 1 0/ 



(A.l) 

Taking / j=g does not compromise the uniqueness of the transformation $ — > 
as shown at the end. 

Now we transform the symmetric matrix K into the 4x4 identity by a 
similarity transformation 



V = — 



1 

3 



/ 1 1 0\ 



V 



-% i 
11 

-i i. 



(A.2) 



so that 



VJCV T =Vg- 1 Ef~ 1 V T = I 



This same transformation converts Ma and M.2 into 



(A.3) 



M 4 = Vg- 1 Mif- 1 V T 
M 2 = Vg- 1 M 2 r 1 V T 



(A.4) 



Notice that M. 2 and M.^ are symmetric as well. This is a consequence of the 
vanishing of some critical elements in both matrices. One then verifies that they have 
the same eigenvalues, namely —ijiM\ , ijiM\ , i/jM 2 and —i[iM 2 , so that 
there exist orthogonal matrices R and T such that 
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T T M±T = R T M 2 R = in diag(-M u M u M 2 , -M 2 ) , 



while conserving the euclidean metric I 



RIR = TIT = I 



With the orthonormal eigenvectors as columns one obtains 



R 



2^/JI 



f-Ri 

-iR± 


V 



where 



Rf 



and 



2<12>V£ 



where 



-iR^ 
Rf 







Mj±(i 
s/M~, 



/T+ 
iT x ~ 

Pi 
\iP+ 



-iTf 
Tf 
iP+ 



o 
o 

Rt 

iRo 



\ 


-Rf) 



-T 2 ~ 
-iT+ 
P + 
iPo 



IP- 

-Po + / 



(A.5) 



(A.6) 



(A.7) 



(A.8) 



(A.9) 



Tf = y/ Mi(fly/ Mi ±(12)) 



pr = 



(\2)y/W i( P 



Mi 



(A.10) 

Notice that one has pure real and imaginary matrix elements and vector components, 
and that the norm of a vector, defined as | V |= V T V , may be imaginary as 
well. Since Mf = ml — A , a regularization (the dimensional one, for instance) is 
understood such that R and T have well defined elements. 
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Finally, from (A. 4) and (A. 5) one gets that 



YM 4 W = M 2 , 

(A.ll) 

where W = f~ x V T T R T V~ xT f and Y = gV~ x RT T Vg- x . The matrix W 
has some imaginary elements and the fourth row is not (0101) , so that it is 
not suitable to relate the real vectors $ and as in (3.19) yet. Moreover, 
Y ^W T . However one may check that 



(i 0\ 

% 

10 

Vo o o 1/ 



Y = X T , where X 



W 



(-% 0\ 

-i 

10 

V o o o 1/ 



is the matrix given in (3.20), so that (A.ll) writes 



(A.12) 



X T M±X = M 2 ■ 

(A.13) 

Furthermore, from (A. 3) and (A. 6) one has that 



X T EX = E . 

(A.14) 

The fourth row of X has the desired elements (0101) only if suitable signs 
are chosen for the eigenvectors that build up R and T , so that the handedness 
of the frame is conserved by X . We stress that X is also well-defined as 
a differential operator, and that the regularization is needed only for defining the 
intermediate operators T and R T . At the end of the process the regularization 
can be put off. 
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